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Abstract 

We analyse the mixing time of Markov chains using path couphng with stopping times. We 
apply this approach to two hypergraph problems. We show that the Glauber dynamics for 
independent sets in a hypergraph mixes rapidly as long as the maximum degree A of a vertex 
and the minimum size m of an edge satisfy m > 2A + 1. We also show that the Glauber dynamics 
for proper g-colourings of a hypergraph mixes rapidly if m > 4 and q > A, and if m = 3 and 
q > 1.65 A. We give related results on the hardness of exact and approximate counting for both 
problems. 



1 Introduction 

We develop a new approach to using stopping times in conjunction with path coupling to bound the 
convergence of time of Markov chains. Our main interest is in applying these results to randomised 
approximate counting. For an introduction, see To illustrate our methods, we consider 

approximation of the numbers of independent sets and g-colourings in hypergraphs with upper- 
bounded degree, and lower-bounded edge size. These problems in hypergraphs are of interest in 
their own right but, while approximate optimisation has received attention [711^ 1181 122j . there has 
been surprisingly little work on approximate counting. 

Our results are achieved by considering, in the path coupling setting, the stopping time at which 
the distance between two coupled chains first changes. The first application of stopping times to 
path coupling was by Dyer, Goldberg, Greenhill, Jerrum and Mitzenmacher Their analysis 

was later improved by Hayes and Vigoda ^7j) using a method closely related to that developed 
in this paper. Theorem 12. H the main technical result of the paper, shows that if the expected 
distance between the two chains has decreased at this stopping time, then the chain mixes rapidly. 
This also follows from |171 Corollary 4]. However we give a simpler proof than that of 17[, and our 
Theorem 12 . II will usually give a moderate improvement in the bound on mixing time in comparison 
with 117, Corollary 4]. See Remark 12.31 below. 

The problem of approximately counting independent sets in graphs has been widely studied, see for 
example [HI ^1 1221 I2H] > but the only previous work on the approximate counting of independent 
sets in hypergraphs seems to that of Dyer and Greenhill [^. They showed rapid mixing to the 
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uniform distribution of a simple Markov chain on independent sets in a hypergraph with maximum 
degree 3 and maximum edge size 3. However, this was the only interesting case resolved. Their 
results imply rapid mixing only for m < A/(A — 2), which gives m < 3 when A = 3 and m < 2 
when A > 4. In Theorem 13. II we prove rapid mixing of the Glauber dynamics for any hypergraph 
such that m > 2 A + 1, where m is the smallest edge size and A is the maximum degree. This 
is a marked improvement for large m. More generally, we consider the hardcore distribution on 
independent sets with fugacity A. (See, for example, ^lUnillHIO In S2]i it is proved that rapid 
mixing occurs if A < m/{{m — 1)A — m). Here we improve this considerably for larger values of 
m, to A < (m — 1)/2A. We also give proofs that computing the number of independent sets in 
hypergraphs is ^^P-complete except in trivial cases, and that there can be no approximation for 
the number of independent sets in a hypergraphs if the minimum edge size is at most logarithmic 
in A. It may be noted that our upper and lower bounds are exponentially different. We have no 
strong belief that either is close to the threshold at which approximate counting is possible, if such 
a threshold exists. 

Counting g-colourings of hypergraphs was considered by Bubley PI , who showed that the Glauber 
dynamics was rapidly mixing if g > 2A, generalising a result of Jerrum [20] and Salas and Sokal 
for graphs. Much work has been done on improving this result for graph colourings, see [S] and 
its references, but little attention appears to have been given to the hypergraph case. Here we 
prove rapid mixing of Glauber dynamics for proper colourings of hypergraphs if m > 4, g > A, 
and if m = 3, q> 1.65 A. For a precise statement of our result see Theorem 15.31 Again we give 
proofs that computing the number of colourings in hypergraphs is ^^P-complete except in trivial 
cases, and that there can be no approximation for the number of colourings of hypergraphs if 
q<{l- l/m)Ai/(™-i). A gain, there is a considerable discrepancy between the upper and lower 
bounds for large m. 

The paper is organised as follows. Section [l.ll gives an intuitive motivation for the stopping time 
approach of the paper. Section [21 contains the full description and proof of Theorem 12. II for path 
coupling with stopping times. We apply this to hypergraph independent sets in Section|21 Section^ 
contains the hardness proofs. Section El contains analysis of the Glauber dynamics for hypergraph 
colouring. Finally, Section |21 contains the hardness results for counting colourings in hypergraphs. 

1.1 Intuition 

Let Ti. = (V, £) be a hypergraph of maximum degree A and minimum edge size m. A subset 5 C V 
of the vertices is independent if no edge is a subset of S. Let Vl{'H) be the set of all independent 
sets of 7i. Let A be the fugacity, which weights independent sets. (See |E1-) The most important 
case is A = 1, which weights all independent sets equally and gives rise to the uniform distribution 
on all independent sets. We define the Markov chain M{'H) with state space Vl{'H) by the following 
transition process {Glauber dynamics). If the state of M at time t is Xt, the state at t + 1 is 
determined by the following procedure. 

(i) Select a vertex f G V uniformly at random, 

(ii) (a) \iv (^Xt let Xt+i = Xt\{v} with probability 1/(1 + A), 

(b) \i V ^ Xt and Xt U {v} is independent, let Xt-\-i = XtU {v} with probability A/(l + A), 

(c) otherwise let Xt+i = Xt. 
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This chain is easily shown to be ergodic with stationary probabihty proportional to A'^' for each 
independent set I Q V. In particular, A = 1 gives the uniform distribution. The natural coupling 
for this chain is the "identity" coupling, the same transition is attempted in both copies of the 
chain. If we try to apply standard path coupling to this chain, we immediately run into difficulties. 
Consider two chains Xt and Yt such that Yt = Xt L) {w}, where w ^ Xt (the change vertex) is of 
degree A. An edge e £ £ is critical in Yj if it has only one vertex z £ V which is not in Yj, and we 
call z critical for e. If each of the edges through w is critical for Yt, then there are A choices of v in 
the transition which can be added in Xt but not in Yj. Thus, if A = 1, the change in the expected 
Hamming distance between Xt and Yt after one step could be as high as ^ — ^- Thus we obtain 
rapid mixing only in the case A = 2. This case has some intrinsic interest, since the complement of 
an independent set corresponds, under hypergraph duality, to an edge cover ^HI in a graph. Thus 
we may uniformly generate edge covers, but the scope for unmodified path coupling is obviously 
severely limited. 

The insight on which this paper is based is as follows. Although in one step it could be more 
likely that a bad vertex (increasing Hamming distance) is chosen than a good vertex (decreasing 
Hamming distance), it is even more likely that one of the other vertices in an edge containing w is 
chosen and removed from the independent set. Once the edge has two unoccupied vertices other 
than w, then any vertex in that edge can be added in both chains. This observation enables us to 
show that, if T is defined to be the stopping time at which the distance between Xt and Yt first 
changes, the expected distance between Xt and Yt will be less than 1. Theorem 12 . II b elow shows 
that under these circumstances path coupling can easily be adapted to prove rapid mixing. 

Having established this general result, we use it to prove that M{7i) is rapidly mixing for hyper- 
graphs with m > 2AA + 1. Note that, though all the results in this paper will be proved for uniform 
hypergraphs of edge size m, they carry through trivially for hypergraphs of minimum edge size m. 

2 Path coupling using a stopping time 

First we prove the main result discussed above. 

Theorem 2.1. Let M he a Markov chain on state space Let d he an integer valued metric 
on 0, X Q, and let (Xf, Yj) he a path coupling for A4, where S is the set of pairs of states {X,Y) 
such that d{X,Y) = 1. For any initial states {Xq,Yo) € S let T be the stopping time given by the 
minimum t such that d{Xt, Yt) ^ 1. Suppose, for some p > 0, that 

(i) Pr(T = t\T > t) > p, independently for each t, 
(ii) B[d{XT,YT)] <a<l. 

Then A4 mixes rapidly. In particular the mixing time r(e) of M satisfies 

r{e) < ^-J—\n{eD2)ln(^^), 
p 1 - a Ve(l - q)/ 

where Di = max{d(X, Y):X,Yen} and D2 = max{d(XT, Yt) : Xq, Yo e Q, d{Xo, Yq) = 1}. 

Proof. Consider the following game. In each round a gambler either wins £1, loses some amount 
£{l — 1) or continues to the next round. If he loses £{l — 1) in a game, he starts / separate (but 
possibly dependent) games simultaneously in an effort to win back his money. If he has several 
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games going and loses one at a certain time, he starts I more games, while continuing with the 
others that did not conclude. We know that the probability he finishes a game in a given step is at 
least p, and the expected winnings in each game is at most 1 — a. The question is: does his return 
have positive expectation at any fixed time ? We will show that it does. But first a justification for 
our interest in this game. 

Each game represents a single step on the path between two states of the coupled Markov chain. 
We start with Xq and Yq differing at a single vertex. The first game is won if the first time the 
distance between the coupled chains changes is in convergence. The game is lost if the distance 
increases to Z. At that point we consider the distance I path Xt to Yt, and the I games played 
represent the I steps in the path. Although these games are clearly dependent, they each satisfy 
the conditions given. The gambler's return at time t is one minus the length of the path at time t, 
so a positive expected return corresponds to an expected path length less than one. We will show 
that the expected path length is sufficiently small to ensure coupling. 

First note that the gambler's return at time t is one minus the number of games active at time 
t. For the initial game we define the level to be zero, for any other possible game we define the 
level to be one greater than the level of the game whose loss precipitated it. We define the random 
variables Mf;,ljk and Ijk{t) as follows. is the number of games at level k that are played, Ijk, 
for j = 1 . . . Mk, is the number of games in level k + 1 which are started as a result of the outcome 
of game j in level k, and Ijk{t) is an indicator function which takes the value 1 if game j in level k 
is active at time t, and otherwise. Let N(t) be the number of games active at time t. Then, by 
linearity of expectations, 

k=0 



(1) 



We will bound this sum in two parts, splitting it at a point k = K tohe determined. For k < K 
we observe that Mk < -Df- Since Pr(/jfc(t) = 1) is at most the probability that exactly k — 1 games 
of a sequence are complete at time i, regardless of outcome, we have 



E 

So that 



Mk 



< maxE[7jfe(t)] < -D2 Pi'lsxactly A; — 1 games complete by time t). 
j 



K 



k=0 



Mk 



K 



< E Z»f Pr(exactly k-1 games complete by time t) 



fc=o 



< Z)2^Pr(at most K games complete by t). 
On the other hand, for k > K we observe that 



(2) 



E 



Mk 



Mk- 



Y^Ijkit) < E[Mfe] = Em,_i [E[Mfc|Mfe_i]] = Em,_i[E[E Z,-fc_i|Mfe_i]] 
■ j=i -I j=i 

Since E[Zjfc_i] < a for any starting conditions, we may apply this bound even when conditioning 
on Mfe_i. So 



E 



Mk 

3=1 



< E[aMfc_i] < a'' 



(3) 
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using linearity of expectation, induction and E[Mi] < a. Putting ((21) and Q together we get 

oo 

E[iV(t)] < Pr(at most K games complete by t) + ^ 

k=K+\ 

K a^+^ 

= D2 Pr(at most K games complete by t) + . (4) 

1 — Q 

We now set K = [(Ina)""*^ ln(^^:^^)J, hence the final term is at most e/2Di. The probability that 
a game completes in any given step is at least p. If we select a time t > c/p for K + \ > 1, then 
the probability that at most K games are complete is clearly maximised by taking this probability 
to be exactly p in all games. Hence, by Chernoff's bound (see, for example, |191 Theorem 2.1]), 

K 

-k 



E[iV(r)]< ^2^E(^)p'(1 
fc=o ^ ^ 



2Di 

< K\nD2+K-c/2 . ^ 

2Di' 

Choosing c = 2A'ln(eL>2) +2 In we obtain E[N{t)] < ^, where r = [^(Sf • 

We conclude that the gambler's expected return at time r is positive. More importantly, for any 
initial states Xq, Yq ^1, the expected distance at time r is at most e by linearity of expectations, 
and so the probability that the chain has not coupled is at most e. The mixing time claimed now 
follows by standard arguments. See, for example, j21j . □ 

Remark 2.2. The assumption that the stopping time occurs when the distance changes is not 
essential. We clearly cannot dispense with assumption (ii), or we cannot bound mixing time. 
Assumption (i) may appear a restriction, but appears to be naturally satisfied in most applications. 
It seems more natural than the assumption of bounded stopping time, used in 17^- Assumption (i) 
can easily be replaced by something weaker, for example by allowing p to vary with time rather 
than remain constant. Provided p ^ sufficiently often, a similar proof will be valid. 

Remark 2.3. Let 7 = 1/(1 — a). It seems likely that D2 will be small in comparison to 7 in most 
applications, so we might suppose D2 < 7 < Di. The mixing time bound from Theorem 12.11 can 
then be written 0{p~^ j log D2log(Di/e)). We may compare this with the bound which can be 
derived using |17| Corollary 4]. This can be written in similar form as 0(p^^7 log 7 log(Di/e)). In 
such cases we obtain a reduction in the estimate of mixing time by a factor log7/logD2- In the 
applications below, for example, we have D2 = 2 and 7 = 0(A), so the improvement is r2(log A). 

Remark 2.4. The reason for our improvement on the result of is that the use of an upper bound 
on the stopping time, as is done in will usually underestimate the number of stopping times 
which occur in a long interval, and hence the mixing rate. 



3 Hypergraph independent sets 

We now use the approach of path coupling via stopping times to prove that the chain discussed in 
Section Fl. II is rapidly mixing. The metric used in path coupling analyses throughout the paper will 
be Hamming distance between the coupled chains. We prove the following theorem. 
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Theorem 3.1. Let A, A be fixed, and let Ti he a hypergraph such that m > 2AA + 1. Then the 
Markov chain A4{7i) has mixing time 0(n log n). 

Before commencing the proof itself, we analyse the stopping time T for this problem. 
3.1 Edge Process 

Let Xt and Yt be copies of Ai which we wish to couple, with Yq = Xq U {w}. Let e be any edge 
containing w, with m = \e\. We consider only the times at which some vertex in e is chosen. The 
progress of the coupling on e can then be modelled by the following "game". We will call the 
number of unoccupied vertices in e (excluding w) units. At a typical step of the game we have k 
units, and we either win the game, win a unit, keep the same state or lose a unit. These events 
happen with the following probabilities: we win the game with probability 1/m, win a unit with 
probability at least (m — A; — 1)/(1 + A)m, lose a unit with probability at most Xk/{1 + X)m and 
stay in the same state otherwise. If ever k = 0, we are bankrupt and we lose the game. Winning 
the game models the "good event" that the vertex v is chosen and the two chains couple. Losing 
the game models the "bad event" that the coupling increases the distance to 2. We wish to know 
the probability that the game ends in bankruptcy. We are most interested in the case where k = 1 
initially, which models e being critical. Note that the value of k in the process on hypergraph 
independent sets dominates the value in our model, since we can always delete (win in the game), 
but we may not be able to insert (lose in the game) because the chosen vertex is critical in some 
other edge. 

Let Pk denote the probability that a game is lost, given that we start with k units. We have the 
following system of simultaneous equations. 

(m — 1 + 2A)pi — (m — 2)p2 = A 
-kXpk-i + {m - k + {k + l)X)pk - {m - k - l)pk+i = (/c = 2, 3, . . . , m - 1) (5) 

Adding the equations in (jSJ from the k^^ onwards gives 

(m - l)pi + mXpm-i = X 
-kXpk^i + (m - k)pk + mXpm-i = (A; = 2, 3, . . . , m - 1). (6) 

Now is equivalent to ©, since we have simply multiplied the coefficient matrix of © by an 
upper triangular matrix with all entries 1. This transformation is clearly nonsingular. We will show 
by induction that @ has solution 

Pk = l^l^^ {k = l,2,...,m-l). (7) 

When fc = 1, the first equation in © is clearly satisfied by (jT)). Assume by induction that (O is 
true for Pk-i, with k >2. Then 
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Xk Am 

Pk = rPk-i rPrn-l 

m — k m — k 

^ Xk A^-i-E-=i (T)A''Wi Am 

m — k Ck i) m — k ^"^ ^ 

continuing the induction. For consistency, we must clearly have 

\m-l _ fm\ \rn-i^ 

Pm-1 — fm-l\ ~ "UJ^ + '^J -i-A )Pm-l, 

\m-l) 

^m— 1 

1-e. Pm-1 = _^ _ (8) 
Using 0, can be rewritten 

= ^(a^- ^^=iUJA \ 2..=.+ajA (A; = l,2,...,m-1). (9) 

(V)V (1 + A)™-A- ; ((i + A)--A-)(V) 

In particular 

A / mA"^-! \ , , 



3.2 The expected distance between Xt and 1^ 

The stopping time for the pair of chains Xt and Yt will be when the distance between them changes, 
in other words either a good or bad event occurs. The probability that we observe the bad event 
on a particular edge e with w G e is at most as calculated above. Let denote the number of 
empty vertices in e at time t when the process is started with = k. Now (^t can never reach 
without first reaching k — 1 and, since the process is Markovian, it follows that 

Pk = Pr{3tCt = OlCo = k) = Pr(3t6 = 0|6 = k- l)Pr(3s6 = k- l\Co = k) < pt-i- 

Since w is in at most A edges, the probability that we observe the bad event on any edge is at 
most Api. The probability that the stopping time ends with the good event is therefore at least 
1 — Api. The path coupling calculation is then 

E[d(XT,lT)] < 2Api. 

This is required to be less than 1 in ordered to apply Theorem 12.11 If m > 2AA + 1, then by (|1U|) 

(2AA + 1)A^^^ 

2Api-l - ^-^^^^2AA+l_A2AA+r 
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Proof of Theorem \'j.l\ The above work puts us in a position to apply Theorem 12.11 Let m > 
2AA + 1. Then for M{n) we have 



(i) Vv{d{XuYt) + l|d(Xt_i,yt_i) = 1) > ^ for all t, and 

(ii) E[d(XT,yT)] < 1 - ^^'^.^aXi 1^1^^^^^^ 

Also for Al('H) we have D\ = n and D2 = 2. Hence by Theorem 12.11 M{7i) mixes in time 

(1 + A)2^^+l - A2^^+1 / _i (1 + A)2^^+l - A2^^+1 

r(e) < 6n — -— . . In I ne 



(2AA + 1)A2AA V (2AA + 1)A2A^ 

This is 0(n log n) for fixed A, A. □ 

Remark 3.2. In the most important case, A = 1, we require m > 2A + 1. This does not include 
the case m = 3, A = 3 considered in p[2| • We have attempted to improve the bound by employing 
the chain proposed by Dyer and Greenhill in |12| Section 4]. However, this gives only a marginal 
improvement. For large AA, we obtain convergence for m > 2AA + ^ + o(l). For A = 1, this gives 
a better bound on mixing time for m = 2A + 1, with dependence on A similar to Remark 13.31 
below, but does not even achieve mixing for m = 2A. We omit the details in order to deal with 
the Glauber dynamics, and to simplify the analysis. 

Remark 3.3. The terms in the running time which are exponential in A, A would disappear if we 
instead took graphs for which m > 2AA + 2. In this case the running time would be 

T(e) < 6(2AA + l)nln(ne"^(2AA + 1)) < 12(2AA + l)nln(ne~^). 

Furthermore, if we took graphs such that m > (2 + (5)AA, for some 6 > 0, then the running time 
would no longer depend on A, A at all, but would be r(e) < csnln{ne^^) for some constant cs- 

Remark 3.4. It seems that path coupling cannot show anything better than m linear in AA. Suppose 
the initial configuration has edges {w, vi, . . . , Vm-2,Xi} for i = 1, . . . , A, with w, vi, . . . , Vm-2 G Xq, 
xi, . . . , XA ^ -'^o and w the change vertex. Consider the first step where any vertex changes state. 
Let fj, = (1 + A) (m — 1 + A) . The good event occurs with probability (1 + A)//i, insertion of a critical 
vertex with probability XA/fi, and deletion of a non-critical vertex with probability (m— 1)/^- We 
therefore need (m — 1) + (1 + A) > AA, i.e. m > A(A — 1), to show convergence by path coupling. 

Remark 3.5. It seems we could improve our bound m > 2AA + 1 for rapid mixing of the Glauber 
dynamics somewhat if we could analyse the process on all edges simultaneously. Examination of 
the extreme cases, where all edges adjacent to w are otherwise independent, or where they are 
dependent except for one vertex (as in Remark EH}, suggests that improvement to (1 + o(l))AA 
may be possible, where the o(l) is relative to AA. However, the analysis in the general case seems 
difficult, since edges can intersect arbitrarily. 



4 Hardness results for independent sets 

We have established that the number of independent sets of a hypergraph can be approximated 
efficiently using the Markov Chain Monte Carlo technique for hypergraphs with edge size linear 
in A. We show next that exact counting is unlikely to be possible, and that our approximation 
scheme cannot be extended to cover all hypergraphs with edge size r2(logA). 
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4.1 Tv'^P-completeness 



We show that the exact counting problem is T^P-Complete except in trivial cases. 

Theorem 4.1. Let G{m,A) be the class of uniform hypergraphs with minimum edge size m > 3 
and maximum degree A. Computing the number of independent sets of hypergraphs in Q(m,A) is 
#P- complete i/A > 2. //A < 1, it is in P. 



Proof. Since m is the minimum edge size, we will assume m > 3. The cases A = 0, 1 are trivially 
in P. As discussed in Section 11.11 independent sets in a hypergraph with A = 2 correspond to 
edge covers in a graph. Counting these is ^^P-complete, even for graphs with arbitrarily large 
minimum degree. This is stated in |H] but without proof, so we provide a proof in Appendix 1X1 
We now consider A > 3. (The case m = A = 3 is discussed in |12j.) Take a graph G = {V,E), 
and construct a hypergraph Q = {V,£) by "extending" each edge e = {vi,V2} € to an edge 
e+ = |tii,uf, . . . ,u'^_2,V2] G £■ Observe that, for each independent set I of G and edge e & E, 
there are 2™""^ — 1 independent assignments to uf, . . . , u^„2 '^^ vi,V2 & I and 2"*"^ otherwise. This 
is equivalent to evaluating the partition function of a weighted H -colouring problem EI on G, 
with weight matrix 

"2^—2 2™~2 
^ 2™~2 2"^~2 2 

The T^P-completeness of //-colouring with this weight matrix follows either directly from [3] or 
indirectly from |131 Corollary 3.2]. The degree bound A = 3 follows from |13| Theorem 5.1], on 
noting that A is nonsingular. □ 



4.2 Approximation hardness 

We now show that unless NP=RP, there can be no fpras for the number of independent sets of all 
hypergraphs with edge size r2(log A). 

Let G = {V, E), with \V\ = n, be a graph with maximum degree A and Ni independent sets of size 
i (i = 0, 2, . . . , n). For A > let Zg{X) = X^ILo -^i-^* define the hard core partition function. The 
following is a combination of results in Luby and Vigoda 23 and Berman and Karpinski pp. 

Theorem 4.2. If \ > 694/A, there is no fpras for Zg{X) unless NP=RP. 

Proof. Let e be a constant such that the size of the largest independent set in a graph of maximum 
degree 4 cannot be approximated to within a ratio (1 + e) unless P = NP. Berman and Karpinski ^ 
show that e > 1/49. Luby and Vigoda [231 Theorem 4] prove the hardness of approximating Zg{\) 
if A > c/A for any c > 20 In 2 (1 + £)/s.^ Together, these two results give the theorem. □ 

We note that Theorem 14. 21 could probably be strengthened using the approach of However, this 
has yet to be done. 

Theorem 4.3. Unless NP=RP, there is no fpras for counting independent sets in hypergraphs 
with maximum degree A and minimum edge size m < 21g(l + A/694) — 1 = Q(log A). 

^The expression in |28| omits the In 2 term 
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Proof. Given a graph G = {V, E) with maximum degree A, we construct a hypergraph 7i = (V, £) 
as follows. Let k = \m/2\. For each v ^V, let Wy = w^2) • • • iW^k} and V = Ungy Fo'^ 
each edge e = {«, € -E, let Se = U W^, and let £" = {Se : e € -E}. It is clear that Ti has 
maximum vertex degree A and every edge has size 2k > m. 

An independent set T in 7^ corresponds to a unique independent set / in G as follows. If C I, 
then V £ I, otherwise v ^ I. Clearly Z independent in Ti implies / independent in G. Note that for 
each V ^ I, there are {2^ — 1) possible subsets of Sy which may be in 2. Thus, if Af is the number 
of independent sets in H, 

n n 

M = Y^Ni{2'' - 1)"-^ = (2^= - l)"^iVi(2'= - 1)-^ = (2^ - 1)"Zg(1/(2'= - 1)). 

i=0 i=Q 

Thus approximating M is equivalent to approximating Zg(A) with A = 1/(2^^ — 1). But, by The- 
orem lOl this win be hard if 1/(2'' - 1) > 694/A. This gives k < lg(l + A/694), which holds 
whenever m < 21g(l + A/694) - 1. □ 



5 Hypergraph colouring 

We now consider Glauber dynamics on the set of proper colourings of a hypergraph. Again our 
hypergraph Ti. will have maximum degree A, minimum edge size m, and we will have a set of q 
colours. A colouring of the vertices of 7i is proper if no edge is monochromatic. Let ^}'(Ti.) be the 
set of all proper g-colourings of H. We define the Markov chain C{H) with state space VL'iTi) by 
the following transition process. If the state of C at time t is X^, the state at f + 1 is determined by 

(i) selecting a vertex t; G V and a colour k € {1, 2, . . . , g} uniformly at random, 

(ii) let X[ be the colouring obtained by recolouring v colour k 

(iii) if X[ is a proper colouring let Xt+i = X[ 
otherwise let Xt+i = Xt. 

This chain is easily shown to be ergodic with the uniform stationary distribution. Again we will 
use Theorem 12. II to prove rapid mixing of this chain under certain conditions, however first we will 
examine the chain using standard path coupling techniques. 

Theorem 5.1. For m > 4, g > A, the Markov chain C{7i) mixes in time 0(n log n). 

Proof. Suppose that two copies of C(H), Xq and Yq say, start at distance one apart, i.e. they differ 
in only one vertex w. Suppose that the number of colours available for recolouring w is q — k, then 
the probability of the two copies of the chain coupling in one step is The distance between 

the two chains can only increase (to 2) if we select a vertex v and recolour it with a colour that is 
permitted in one copy of the chain only. For this to happen, there must be an edge containing v and 
w such that the other vertices in this edge are all either red and we have chosen red for v, or blue 
and we have chosen blue for v. Hence there can be at most one vertex on each edge, and one colour 
for that vertex, such that the chains diverge if we select that vertex and colour. Furthermore, for 
each of the k unavailable colours there must be an edge containing w which, apart from w itself, 
is monochromatic in the forbidden colour, so on these edges there are no vertices whose selection 
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can cause the chains to diverge. Hence the probabihty that the distance increases to 2 in one step 
is at most The path couphng calculation is therefore 

E[d(Xi,yi)]<i-i^ + ^. 

nq nq 

If g > A + 1 then E[d(Xi, li)] < 1 — ^/nq, and therefore by the path coupling theorem the mixing 
time is 

r(e) < nqln{ne^^). ^ 



This analysis leaves little room for improvement in the case m > 4, indeed it is not clear whether 
the Markov chain described is even ergodic for q < A. The following simple construction does 
show that the chain is not in general ergodic if q < ^ + 1. Let q = ^ + 1, and take a hypergraph 
7i on q{m — 1) vertices. We will group the vertices into q groups V = Vi, V2, ■ ■ ■ ,Vq, each of size 
m — 1. Then the edge set oi Ti. is E = {{v} LiVj : v £ V,v ^ Vj}. The degree of each vertex is 
{q — l) + {q — l)(m — 1) = A. If we now colour each group Vj a different colour, we obtain ql distinct 
colourings, but for each of these the Markov chain is frozen (no transition is valid). 

The case m = 2 is graph colouring and has been extensively studied. See, for example, [S]. This 
leaves the case m = 3, hypergraphs with 3 vertices in each edge. The standard path coupling 
argument, as in Theorem 15. 11 only shows rapid mixing for q > 2 A, since there may be two vertices 
in each edge that can be selected and lead to a divergence of the two chains. This occurs if, of the 
two vertices in an edge which are not w, one is coloured red and the other blue. However, we can 
do better using Theorem 12.11 We will need the following technical Lemma. 

Lemma 5.2. Let ip{d) = 1 - d{l - e-(9-^+'^)*/Afg)/(g _ a + d). For all t > and all d > 1, 

Proof. Let n = q — A>0, x = t/Mq > 0. We wish to show that 

V'(x) = ip{d) - ip{l)'^ = l-d{l- e-^''+'^^^)/{K + d) - (1 - (1 - e-(''+i)^)/(K + l))'^ > 0. 

Since ^(0) = 0, it suffices to show that ip{x) is increasing for all 2; > 0. But 

V''(x) = -(ie-(''+'^)^ + de-('^+^)^(l-(l-e-('"+^)^)/(K + l))'^"^ 

= de-('^+^)"((l - (1 - e-(''+i)")/(K + 1))''"' - e-^'^-^)"), 

so it suffices to show 1 - (1 - e-(''+i)^)/(/t + 1) > e"^. Let C{x) = 1 - e"^ - (1 - e-(''+i)^)/(K + 1). 
Then ^(0) = 0, so we need only show that C{x) is increasing for all x G (0, 00). But 

('(x) = e"^ - e-^'^+i)^' = e-^(l-e-''^) > 0, 

for X > 0. □ 



Theorem 5.3. There exists Aq such that, if 7i is a 3-uniform hypergraph with maximum degree 
A > Aq and q > 1.65 A, the Markov chain C(7i) mixes rapidly. 
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Proof. We choose Aq large enough that all the approximations below are valid. We couple two 
copies of this chain using the identity coupling. Let X and Y be two copies of C(7i) such that Xq 
and Yq differ only at a single vertex w. As before, we will examine the stopping time T at which 
d{XT,YT) 7^ 1 for the first time, and show that E[d(XT,lr)] < 1- We assume that w is coloured 
blue in Xq and red in Yq. We will call any other colour neutral. Let T{w) denote the set of vertices 
of 7i that share an edge with w. We will only consider transitions in which either w or a vertex in 
T(w) is selected, since any transition which involves any other vertex will not change the distance 
between X and Y. Let M = |r(it;)| + 1. We will first assume that none of the edges containing 
w is otherwise monochromatic, and hence that all colours are available for rccolouring w. We will 
deal with other cases later. Let St denote the event that T = t and d(Xr, Yt) = 0, which we will 
call success. The bad event we will call failure. 

The probability that the two chains couple in any one step is For each v G r(tt;), let Py^t 

be an indicator variable which takes value 1 if u is either red or blue after t steps of the chain, 
and takes value otherwise. We describe a choice of vertex v e T{w) and colour c G {red,blue} 
at step t as 'bad' if there is an edge containing v and w whose other vertex is currently coloured 
c, and let Bt denote the number of bad choices at time t. The probability of failure in step t is 
therefore For each v G T(w) let be the number of edges which contain both v and w. Then 
Bt < I]„er(«;) dvPv,t- Now, using ^ to imply equality up to a factor 1 + oa(1), 



Pr{St) = E 



J-J- \ M MqJ M 



'-j=0 



> — E 
- M 



2^j=0\M' Mq > 



M 



-E 



i-ij=Q^M~ Mql 



-t/M 



M 



-E 



[n 



dw v^t — 1 a 
g Mq l^j=0 P'^ 



(11) 



We will now study the properties of fd^^t, with a view to analysing E[e '^i ^i=o'^'"- ]. Note that the 
probability has not coupled or diverged by time 40A is at most 

(1 - 1/M)20^ < (1 - l/2A)^o^ < e-2° < 10-^ 

so we consider times only up to 40A. Let = max{f : t < 40A and Pv,t = 1}- If v starts out 
either red or blue, the probability that it is recoloured to a neutral colour in each step is at least 
(g — A — 2)/Mq. Also, the probability that it becomes red or blue before time 40A is at most 
80A/Mg. Hence 



Pr(i^ >t)< 1 



g- A-2 
Wq 



+ 



80A 
Wq 



q-A, 
Mq ' 



since the second term is 0(1/ A) and small compared to the first, which is 0(1) for f < 40 A. Now 

■13= 



we can bound X]^=o fiv,t by the minimum of t and t„, an exponentially distributed random variable 
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with parameter We are in a position to bound E[e z.j=o'^''.*j g^g follows. 



E[e E - jA.t] > ^Pr(t^ = j)e "^'i^ + Pr(t^ > t)e 

3=0 



Mq 



q - A .aziA 



Mq 

1 -^^ — - 1 - e 

g - A + V 



Inserting this into Equation we get 



where '^{d) was defined in Lemma [5. 21 Since Ti^^Y{w)dv = 2A, Lemma [5.21 implies that for all t > 0, 

\2A 



n '/'(4)>^(i) 

t;er(«)) 



Hence, for f < 40A, 



E[Pr(s.)l > — (i-j3aTtO-="^)) 

p;;;; P M q — A ^ ' 

M 

Finally, noting that Pr(d(Xx,l^) = 0) = Yl'u=o^'^{^t) by linearity of expectation, we have 
Pr(d(XT,yr) = 0) > j M^'^'"^ 



. 40A 





If we now substitute q = 1.65A and M < 2A, we see that 

Pr(d(XT,yT)=0) > / e-^-3.077(l-e-o.394i.)^^ ^ 

Since d{XT,YT) G {1,2}, it follows that E[d(XT,lT)] < 0.9994 and we can apply Theorem O 
This yields the claimed result. 

We have assumed that all colours are available for recolouring w at every step. This will not be 
the case if there is any edge e adjacent to w for which e \ {w} is monochromatic. Let us call such 
an edge blocking, and suppose there are pt blocking edges at time t. Note that the failure cannot 
occur on a blocking edge. The total number p' of blocking edges created during time 40A is at 
most In A since 

Pr(p'>lnA) < 1 < -il =o _L 



InAjKqJ ~ VlnA/ VAt 
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for every constant 7 > 0. Since In A is negligible in comparison with q and M, these do not affect 
the probability estimates in the proof above. Thus we may assume that all blocking edges exist 
initially. We may further assume these persist until termination, so pt = po = p for all t. This 
can only decrease the probability of success. We now observe that this is no worse in our analysis 
than taking A' = A — p and q' = q — P- Let us formally define M' = Mq/q' . Then the conditional 
success probability is {q — p)/Mq = 1/M', and the recolouring probability at each step is at least 
(g — A)/Mq = (q' — A')/M'q'. The analysis now proceeds as before. Since M' plays no part in 
the final condition, we finally require q' > 1.65A', i.e. q > 1.65A — 0.65p. This is clearly a weaker 
condition than q > 1.65 A. □ 

Remark 5.4. If we let (3 = {q — A)/q then, as Aq — > 00, the analysis can be tightened slightly to 
work for (3 > (3* , where /3* is the root of the equation 



The integral can be expanded, by parts integration, as an infinite series to give an alternative 
equation 

fe^n}=o(i + j/?) 

This has root /3* = 0.392729, giving q > 1.64671. 

Remark 5.5. A route to improving our bound on q would be to consider the changes in the numbers 
of colours available at each vertex of T{w) during the process. We make the pessimistic assumption 
that this is always q — A but, while this could be true initially, we would expect more colours to 
become available later on. A proper analysis of this effect seems more difficult, however, because 
0(A^) vertices are now involved, and the edges containing them may intersect. 



6 Hardness results for colouring 

6.1 T^P-completeness 

Again we show that exact counting is #P-complete except in the few cases where it is clearly in P. 
Let Q{m,A) be as in Theorem 14. II 

Theorem 6.1. Computing the number of q- colourings of hypergraphs in G{m,A) is #P-complete 
if A,q > 1. If A < 1 or q < 1 it is in P. 

Proof. Again we assume m > 3. The cases A < 1, q < 1 are trivially in P. The case A = 2 
corresponds to counting edge g'-colourings of graphs in which no vertex is monochromatic. We 
call an edge colouring with no monochromatic vertex a weak edge colouring. Counting weak edge 
colourings is #P-complete for graphs of arbitrarily large minimum degree. We give a proof in 
Appendix IbI 

For A > 3, g > 2, we use the construction from the proof of Theorem 14.11 For each colouring 
X : y — > {1, 2, . . . , of G and edge e G E, there are _ ^ permitted colourings of uf, . . . , ^^-2 
if X{vi) = X{v2) and (7™^^ otherwise. The corresponding ff-colouring problem has the following 
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q y- q weight matrix: 



A 



q- - - I q- 

gm—2 gm—2 



^m-2 



^m-2 



^m-2 



^m-2 



1 



The #P-completeness of //-colouring with this weight matrix, and the bound A = 3, follow as in 
Theorem 14.11 since A is again nonsingular. □ 



6.2 Hardness of Approximation 

Again let ^(m, A) be as defined in Theorem 14.11 Our result, Corollary 16.31 follows directly from 
the following NP-completeness proof. 

Theorem 6.2. Determining whether a hypergraph in Q{m,A) has any q-colouring is NP-complete 
for any m > 1 and 2 < q < {1 — l/m)A^/^"^~^\ 

Proof. If m = 2, this is graph colouring, and the result follows from I14| Theorem 1.4]. (See 
also j25j.') For m > 3, we use the following reduction from graph colouring. Let G = {V,E) be a 
graph with degree Aq, and 2 < q < 3Ag'/4. Without loss, we may assume Aq = [4g/3]. Colouring 
G with q colours is NP-complete For each edge e = {^1,^2} £ E, let = {ufi,uf2^ ■ ■ ■ I'^im) 
[i = 1,2,... ,g) and Vg = Vjl^iSf. Let £q comprise all subsets of Vq of size m other than Sf 
{i = 1,2, . . . , q). We claim that any proper q-colouring of the hypergraph Ti.Q = (Vo,<f^o) must 
assign the same colour to all ufj £ Sf {j = 1,2, ... ,m), and a different colour for each i = 1, . . . , q. 
The claim holds since there must be some colour class of size at least m, since there are q colours 
and mq vertices. If there was a colour class of size greater than m, at least one of its subsets of size 
m would be a monochromatic edge. Thus there must be exactly q colour classes, each of size m. If 
these are not the Sf {i = 1,2, ... ,q), again there is a monochromatic subset of size m which is an 
edge. Clearly, by symmetry, any assignment of the q colours to the q classes Sf is permissible. 

Let = VoU{fi,t'2}, and add the edges {vi,uf2, ■ ■ ■ ,Uim} {i = ^1 ■ ■ ■ 1 Lq/^J) and {v2,ul2, ■ ■ ■ ,Uim} 
{i = [q/2\ -|- 1, . . . , g) to to give We claim that, in any proper g-colouring of the hypergraph 
Ti.^ = {V^,£^), vi and V2 must receive different colours. The claim holds since vi can have any 
colour different from all Sf {i = 1,2,..., [q/2\), and V2 any colour different from all Sf {i = 
[g/2j +1, . . . ,q). But these permitted colour sets for vi and V2 are disjoint. Also, given any colours 
for vi and V2, there are \_q/2\\q/2~\{q — 2)\ > colourings of 7i^. Thus we may use TL^ to simulate 
the edge e £ E. Thus we set V = Uegs"'^'^' ^ ~ Uee-E'^'^ ^^'^ consider the hypergraph 7i = {V,£). 
Then 7i is g-colourable if and only if G is g-colourable. 

The maximum degree in TC of any Uij is (J^^) > eg"^~^. The degree in Ti of each w G y is at 
most AG\q/2] < (4g + 2){q + l)/6 < 2q^ . Thus A = > {mq/{m - l))™-\ and hence 

g < (1 - l/m)Ai/(™-i). □ 

Corollary 6.3. Unless NP=RP, there is no fpras for counting q-colourings of a hypergraphs with 
maximum degree A and minimum edge size m if 2 < q < {1 — l/m)A^/(''"^^) . 

Proof. We cannot tell if there is any colouring for q in this range, so there can be no fpras. □ 
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Remark 6.4. It is clearly a weakness that our lower bound for approximate counting is based 
entirely on an NP-completeness result. However, we note that the same situation pertains for 
graph colouring, which has been the subject of more intensive study. 

7 Conclusions 

We have presented an approach to the analysis of path coupling with stopping times which improves 
on the method of in most applications. Our method may itself permit further development. 

We apply the method to independent sets and (7-colourings in hypergraphs with maximum degree 
A and minimum edge size m. In the case of independent sets, there seems scope for improving the 
bound m > 2 A + 1, but anything better than m > A + o(A) would seem to require new methods. 
For colourings, there is probably little improvement possible in our result q > A for m > 4, but 
many questions remain for m < A. For example, even the ergodicity of the Glauber (or any other) 
dynamics is not clearly established. For the most interesting case, m = 3, the bound q > 1.65A 
(for large A) can almost certainly be reduced, but substantial improvement may prove difficult. 

Our T^P-completeness results seem best possible for both of the problems we consider. On the 
other hand, our lower bounds for hardness of approximate counting seem very weak in both cases, 
and are far from our upper bounds. These lower bounds can probably be improved, but we have 
no plausible conjecture as to what may be the truth. 
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Appendices 

A Edge cover is #P-complete 

Proof. We prove this by reduction from counting independent sets, using methods similar to Bubley 
and Dyer [3], where this result was claimed without proof. Let G i>e a class of 3-regular graphs for 
which counting independent sets is #P-complete '16' Theorem 3.1]. Let G = (y,E), with Ij{G) 
independent sets of size j {j = 0, 1, . . . ,n). Form G' by subdividing each edge e £ E with a new 
vertex Ue. Let U = {ue : e G E}. Let Ni{G') be the number of edge sets in G' which leave exactly 
i vertices in V uncovered, but no vertex in U. In particular, Nq{G') is the number of edge covers 
of G', and we assume an oracle computing this quantity. Observe that the uncovered vertices in G' 
must form an independent set in G. Then it follows, similarly to [31, that 



Thus, if we can determine the Ni{G'), we can determine the number of independent sets of all 
sizes in G. Let Nij{G') be the number of edge sets of G' in which i vertices in V and j in U are 
uncovered (i = 0, . . . , n, j = 0, . . . , 3n/2). Then Ni{G') = Nio{G'). We attach a copy of Km 
to each vertex v G V and a copy -ftT^ of to each vertex u G U. Call the resulting graph Gmk- 
Let Mm be the number of edge covers of Km, then Mm-i is the number of edge sets in Km which 
leave a fixed vertex uncovered. We can show by inclusion-exclusion that 




i=j 
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(See 'TT^-) It is easy to show that that Mm/Mm-i is a rapidly increasing sequence (in fact 
Mm/Mm-i ~ 2™"^ for large m), and hence has a different value for every value of m. We have 

n 3n/2 

NoiG'mk) = E E MiiMm + Mm-ir~'Mi{Mk + Mk-lf^/^^' N^jiC) . (12) 
i=0 j=0 

n / ,^ \ n~i 3n/2 , , 3n/2-j 

By choosing any (n + 1) values of m and any (3n/2 + 1) values of k, we can determine all the 
Nij{G') by interpolation, and hence all the Ni{G'). From these, we can determine all the Ij{G), 
and hence X]j=i ^ji^)^ the total number of independent sets in G. 

Note that the minimum degree of G^^, min{m, k} — 1, can be made as large as we wish. □ 

B Weak edge colouring is #P-complete 

Proof. We use the same notation and construction as in Appendix^ Now ^ is a class of 3-regular 
graphs for which vertex g-colouring is T^P-complete Theorem 2.2]. Let Ni{G') denote the 
number of edge colourings of G' with i monochromatic vertices, so Nq(G') is the number of weak 
edge colourings of G' , and we assume an oracle for this. Let Nij{G') be the number edge colourings 
of G' in which i vertices in V and j in U are monochromatic (i = 0, . . . , n, j = 0, . . . , 3n/2). Now 
observe that NnQ{G') is equal to the number of proper vertex g-colourings of G, Qq{G) say. In 
every colouring counted in Nno{G'), every vertex is monochromatic and adjacent vertices receive 
different colours. Again we attach a copy of Km to each vertex v G V, and a copy Kj! of Kk 
to each vertex u G [/, to give Gmk- Let Mm be the number of weak colourings of Km, and Mm the 
number of edge colourings of Km with a given monochromatic vertex. Now we have 

/ \ ^ / \ / — \ m— 1 / ^\ , — — 1\ 

Mm = ^^^^^B-in")^ ' = 

i=l ^ ^ ^ 1=0 \ * / 

Again the sequence Mm/ Mm increases rapidly {Mm/ Mm ~ q^~'^ for large m), and takes a different 
value for every m when q >2. Now, as in H12|). 

Hence, choosing (n + 1) values of m and (3n/2 + 1) values of k, we can determine all the Nij{G') 
by interpolation. In particular, we can determine A^„,o(G") = Qq{G). 

Again the minimum degree of G^^, min{m, /c} — 1, can be made arbitrarily large. □ 
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